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RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 1Q|0^^,>^\ 



Source: OlPf. 



Date Processed by STIC: Hlt\ot/ 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EFTHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gOV or phone 703-306-4119 (R, Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gOV or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http;//www.uspto.gov/web/oflices/pac/checker 

Applicants submitting genetic sequence k(formation electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may nave been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark OfRce. and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://vmw.uspto.gov/ebc/efs/downloads/docuinents,htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patcnt*and Trademark Office, Technology Center 1600, Reception Area, 7* Floor, Examiner Name, 
Sequence Lafomiation, Qystal Mall One, 191 1 South Clark Street, Arlington, VA 22202 



U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Qystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 
4. Federal Express, United Parcel Service, or other delivery service to! U.S. Patent and Trademark Office, 
Box Sequerfce, Room 1B03-Maihoom, Crystal Plaza Two, 201 1 South Claik Place, Arlington, VA 22202 



Or 



Revised 01/29/2002 



9 



Raw Sequence Listing Error Summary 



0 



ERROR DETECTED SUGGESTED CX)RRECTION ' SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



1 Wrapped Nudeics 
Wrapped Aminos 



The numberAext at the end of each line **wrapped*' down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent •Swapping." 



^Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



_Misa]igned Amino 
Numbering 

Non-ASCII 



_Variablc Length 



Patentin 2.0 
" "bug" 



_Sldpped Sequences 
(OLD RULES) 



8 Skipped Sequences 

(NEW RULES) 



10 



V to 



Use of n*s or Xaa's 
EW RULES) . 



Invalid <213> 



11 



12 



l&eof<220> 



Patentin 2.0 
••bug- 



The numbering under each 5* amino acid is misaligned Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCn(IX)S) text, as required by the Sequence Rules. Please 
ensure your subsequent submission b saved In ASCII text 

Sequence(s) ^contain n*s or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>*<223> section that some may be missing. 

A "bug" in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) " . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino add sequence. Thb appllea to the mandatoiy <220>^<223> sections for 
Artificial or Unknown sequences.' 

Sequencc(s) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRlPnON:SEQ ID NO:X: (insert SEQ ID NO where "X'Ms shoWn) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:** response to Include the skipped sequences. 



Sequence($) _ 



_ missing. If Intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 

Use of n*s and/or Xaa*s have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> b MANDATORY if n*s or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence I^hle^s, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial-Sequence 



Sequence(8) _ 



_ missing the <220> "Feature** and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> sectioa 

(See "Federal Register,** 06A) 1/1998, Vol. 63, No. 104, pp. 2963 1-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk** function of Patentin version 2.0. This causes a corrupted file, . 
resuhing in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use *'File Manager" or any other manual means ta copy file to floppy disk. 



13 



Misuse of n 



n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 
any vahie not spedficalfy a nucleotide. 



AMG^yfH - Biotechnology Systems Branch - 08/21/2001 
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Does Not Comply 
Oorrectdd Diskette Needed 



OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/099,791 



DATE: 
TIME: 



04/02/2002 
15:37:47 



Input Set : 
Output Set: 



C--> 

c--> 



3 
5 
7 
9 
9 
9 
11 
13 
14 
15 
16 
18 
19 
21 
23 
25 
27 
29 
31 
33 
36 
37 
38 
39 
41 
43 
44 
46 
47 
49 
50 
52 
53 
55 
56 
58 
59 
61 
62 
64 
65 
67 
68 



A:\CEN 0245 Seq.txt ^ TtelVpeofefnws shovm exist throughout 

N:\CRF3\04022002\J09979i.raw fts Sequence Listing, r 

sequences for similar errors 

<110> APPLICANT: Heiskala, Marja 
<120> TITLE OF INVENTION: REG-LIKE PROTEIN 
<130> FILE REFERENCE: CEN02B5 

<140> CURRENT APPLICATION NUMBER: US/10/099,791 
<141> CURRENT PILING DATE: 2002-03-14 

<160> NUMBER OF SEQ ID NOS : 45 
<170> SOFTWARE: Patentin version 3.0 
<210> SEQ ID NO: 1 
<211> LENGTH: 477 
<212> TYPE: DNA 
<213> ORGANISM: human 
<400> SEQUENCE: 1 



atggcttcca gaagcatgcg gctgctccta ttgctgagct gcctggccaa aacaggagtc 
ctgggtgata tcatcatgag acccagctgt gctcctggat ggttttacca caagtccaat 
tgctatggtt acttcaggaa gctgaggaac tggtctgatg ccgagctcga gtgtcagtct 
tacggaaacg gagcccacct ggcatctatc ctgagtttaa aggaagccag caccatagca 
gagtacataa gtggctatca gagaagccag ccgatatgga ttggcctgca cgacccacag 
aagaggcagc agtggcagtg gattgatggg gccatgtatc tgtacagatc ctggtctggc 
aagtccatgg gtgggaacaa gcactgtgct gagatgagct ccaataacaa ctttttaact 
tggagcagca acgaatgcaa caagcgccaa cacttcctgt gcaagtaccg accatag 
<210> SEQ ID NO: 2 
<211> LENGTH: 158 
<212> TYPE: PRT 
<213> ORGANISM: human 
<400> SEQUENCE: 2 

Met Ala Ser Arg Ser Met Arg Leu Leu Leu Leu Leu Ser Cys Leu Ala 

15 10 15 

Lys Thr Gly Val Leu Gly Asp lie lie Met Arg Pro Ser Cys Ala Pro 



60 
120 
180 
240 
300 
360 
420 
477 



20 

Tyr His 



Lys 



Gly Trp Phe 
35 

Arg Asn Trp Ser Asp Ala 
50 

Ala His Leu Ala Ser He 
65 70 



25 30 
Ser Asn Cys Tyr Gly Tyr Phe Arg Lys Leu 

40 45 
Glu Leu Glu Cys Gin Ser Tyr Gly Asn Gly 
55 60 

Leu Ser Leu Lys Glu Ala Ser Thr He Ala 
75 80 



Glu Tyr He Ser Gly Tyr Gin Arg Ser Gin Pro He Trp He Gly Leu 

85 90 95 

His Asp Pro Gin Lys Arg Gin Gin Trp Gin Trp He Asp Gly Ala Met 

100 105 110 

Tyr Leu Tyr Arg Ser Trp Ser Gly Lys Ser Met Gly Gly Asn Lys His 

115 120 125 

Cys Ala Glu Met Ser Ser Asn Asn Asn Phe Leu Thr Trp Ser Ser Asn 
130 135 140 



file://C:\Crf3\Outhold\VsrJ099791 .htm 4/2/02 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/099,791. 



DATE: 04/02/2002 
TIME: 15:37:47 



Input Set : A:\CEN 0245 Seq.txt 

Output Set: N:\CRF3\04022002\J099791.raw 



70 
71 



Glu Cys Asn Lys Arg Gin His Phe Leu Cys Lys Tyr Arg Pro 
145 150 155 



73 <210> SEQ ID NO: 3 

74 <211> LENGTH: 78 

75 <212> TYPE: DNA 

76 <213> ORGANISM: human 

78 <400> SEQUENCE: 3 

79 atggcttcca gaagcatgcg gctgctccta ttgctgagct gcctggccaa aacaggagtc 60 
81 ctgggtgata tcatcatg 78 

84 <210> SEQ ID NO: 4 

85 <211> LENGTH: 26 

86 <212> TYPE: PRT 

87 <213> ORGANISM: human 
89 <400> SEQUENCE: 4 

91 Met Ala Ser Arg Ser Met Arg Leu Leu Leu Leu Leu Ser Cys Leu Ala 

92 1 5 10 15 

94 Lys Thr Gly Val Leu Gly Asp He He Met 

95 20 25 

97 <210> SEQ ID NO: 5 

98 <211> LENGTH: 17 

99 <212> TYPE: PRT 

100 <213> ORGANISM: human 
102 <400> SEQUENCE: 5 

104 Cys Ala Glu Met Ser Ser Asn Asn Asn Phe Leu Thr Trp Ser Ser Asn 

105 1 5 10 15 

107 Glu 

no <210> SEQ ID NO: 6 

111 <211> LENGTH: 25 

112 <212> TYPE: PRT 

113 <213> ORGANISM: human 
115 <400> SEQUENCE: 6 

117 Cys Tyr Gly Tyr Phe Arg Lys Leu Arg Asn Trp Ser Asp Ala Glu Leu 

118 1 5 10 15 

119 Glu Cys Gin Ser Tyr Gly Asn Gly Ala 

120 20 25 

122 <210> SEQ ID NO: 7 

123 <211> LENGTH: 23 

124 <212> TYPE: PRT 

125 <213> ORGANISM: human 
127 <400> SEQUENCE: 7 

129 Trp He Asp Gly Ala Met Tyr Leu Tyr Arg Ser Trp Ser Gly Lys Ser 

130 15 10 15 

132 Met Gly Gly Asn Lys His Cys 

133 20 

13 5 <210> SEQ ID NO: 8 

136 <211> LENGTH: 17 

137 <212> TYPE: PRT 

138 <213> ORGANISM: human 
140 <400> SEQUENCE: 8 



file://C:\Crf3\Outhold\VsrJ099791.htm 
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RAW SEQUENCE LISTING DATE: 04/02/2002 

PATENT APPLICATION: US/IO/099, 791 TIME: 15:37:47 

Input Set : A:\CEN 0245 Seq.txt 

Output Set: N:\CRF3\04022002\J099791.raw 

142 Cys Ala Glu Met Ser Ser Asn Asn Asn Phe Leu Thr Trp Ser Ser Asn 

143 1 5 10 15 

145 Glu 

148 <210> SEQ ID NO: 9 

149 <211> LENGTH: 29 

150 <212> TYPE: PRT 

151 <213> ORGANISM: human 
153 <400> SEQUENCE: 9 

155 Cys Ala Glu Met Ser Ser Asn Asn Asn Phe Leu Thr Trp Ser Ser Asn 

156 1 5 10 15 

158 Glu Cys Asn Lys Arg Gin His Phe Leu Cys Lys Tyr Arg 

159 20 25 

161 <210> SEQ ID NO: 10 • 

162 <211> LENGTH: 27 

163 <212> TYPE: PRT 

164 <213> ORGANISM: human 
166 <400> SEQUENCE: 10 

168 Cys Glu Tyr lie Ser Gly Tyr Gin Arg Ser Gin Pro lie Trp lie Gly 

169 15 10 15 

171 Leu His Asp Pro Gin Lys Arg Gin Gin Trp Gin 

172 20 25 

174 <210> SEQ ID NO: 11 

175 <211> LENGTH: 23 

176 <212> TYPE: PRT 

177 <213> ORGANISM: human 
179 <400> SEQUENCE: 11 

181 Cys Gin Ser Tyr Gly Asn Gly Ala His Leu Ala Ser lie Leu Ser Leu 

182 15 10 15 

184 Lys Glu Ala Ser Thr lie Ala 

185 20 

187 <210> SEQ ID NO: 12 

188 <211> LENGTH: 20 v , . | i 

189 <212> TYPE: DNA ^ -^A\i^td Y^pOV^Sf^ e/VY^rv-5tA.tf« SMif 

190 <213> ORGANIsnTC ^nthetic constructll^ it^/wxlO 

192 <400> SEQUENCE: 12 

193 cagctgtgct cctggatggt 20 

196 <210> SEQ ID NO: 13 

197 <211> LENGTH: 20 

198 <212> TYPE: DNA^^^^ 

199 <213> ORGANISM :X ^thet ic construc t^"^ 

201 <400> SEQUENCE: IS*""^ 

202 tggtcggtac ttgcacagga 20 

205 <210> SEQ ID NO: 14 

206 <211> LENGTH: 20 

207 <212> TYPE: DNA^ 

208 <213> ORGANISM :''^>nthetic construct 

210 <400> SEQUENCE: JM^ \. 

211 ctcctattgc tgagctgcct 20 
214 <210> SEQ ID NO: 15 



file://C:\Crf3\Outhold\VsrJ099791 ,htm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/099,791 



DATE: 04/02/2002 
TIME: 15:37:47 



Input Set : A:\CEN 0245 Seq.txt 

Output Set: N:\CRF3\04022002\J099791.raw 



215 <211> LENGTH: 20 

216 <212> TYPE: DNA 

217 <213> ORGANISM^ synthe tic 
219 <400> SEQUElfeE-r 



construct 



"15 

220 attcgttgct gctccaagtt 

223 <210> SEQ ID NO: 16 

224 <211> LENGTH: 19 

225 <212> TYPE: DNA 

226 <213> ORGANISM: 

228 <400> SEQUENCE: 1 

229 ttccagaagc atgcggctg 

232 <210> SEQ ID NO: 17 

233 <211> LENGTH: 19 

234 <212> TYPE: DNA ^--'^^ 

235 <213> ORGANISM: (synthetic 

237 <400> SEQUENCE: 

238 acaggaagtg ttggcgctt 

241 <210> SEQ ID NO: 18 

242 <211> LENGTH: 19 

243 <212> TYPE: DNA 

244 <213> ORGANISM :^ynthetic 

246 <400> SEQUENCE: 1 

247 atggcttcca gaagcatgc 

250 <210> SEQ ID NO: 19 

251 <211> LENGTH: 20 

252 <212> TYPE: DNA 

253 <213> ORGANISM: ^nthetic 

255 <400> SEQUENCE: 

256 ctatggtcgg tacttgcaca 

258 <210> SEQ ID NO: 20 

259 <211> LENGTH: 20 

260 <212> TYPE: DNA 

261 <213> ORGANISM:-^'^^arthetic 

263 <400> SEQUENCE: 20 

264 cttgctctat ggtcggtact 

267 <210> SEQ ID NO: 21 

268 <211> LENGTH: 21 

269 <212> TYPE: DNA 

270 <213> ORGANISM :^(^s^nthetic 
272 <400> SEQUENCE:' 




20 




19 



19 




19 




20 



construct 



20 




273 actgggacca ctggagacac t 
276 <210> SEQ ID NO: 22 
<211> LENGTH: 19 
<212> TYPE: DNA^ 



21 



277 
278 



.281 <400> SEQUENCEs^_22 



f 



282 gagacactga agaaggcag 

285 <210> SEQ ID NO: 23 

286 <211> LENGTH: 20 




19 



file://C:\Crf3\Outhold\VsrJ099791.htm 
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RAW SEQUENCE LISTING DATE: 04/02/2002 

PATENT APPLICATION: US/10/099,791 TIME: 15:37:47 

Input Set : A:\CEN 0245 Seq.txt 

Output Set: N:\CRF3\04022002\J099791.raw 



287 <212> TYPE: DNA 

288 <213> ORGANISM: 

290 <400> SEQUENCE: 

291 agacccagct gtttcafagg^ 

294 <210> SEQ ID NO: 24 

295 <211> LENGTH: 20 

296 <212> TYPE: mjA-- 

297 <213> ORGANISM: synthetic 

299 <400> SEQUEN<^: 

300 aatggagaga gggtr 

303 <210> SEQ ID NO 

304 <211> LENGTH: 2 

305 <212> TYPE: DN. 

306 <213> ORGANISM:^ 

308 <400> SEQUENCE: 

309 tgatatcatc atgagaccca get 

312 <210> SEQ ID NO: 26 

313 <211> LENGTH: 21 

314 <212> TYPE: DNA 

315 <213> ORGANISM :( synthetic 

317 <400> SEQUENCE: 

318 agacagtcat ccatttgccc a 

321 <210> SEQ ID NO: 27 

322 <211> LENGTH: 21 

323 <212> TYPE: DNA 

324 <213> ORGANISM: Synthetic 

326 <400> SEQUENCE: 2 

327 tgggcaaatg gatgactgtc t 

330 <210> SEQ ID NO: 28 

331 <211> LENGTH: 21 

332 <212> TYPE: DNA^--^^^^- 

333 <213> organism/^ synthetic 

335 <400> SEQUENCE :V28^ 

336 ctctagaatc caacaaaactr~-c — 

339 <210> SEQ ID NO: 29 

340 <211> LENGTH: 21 

341 <212> TYPE: DNA 

342 <213> ORGANISM :[ syntlieti^c 
344 <400> SEQUENCE :\^ 
34 5 tgccagacca ggatctgtac^ 

348 <210> SEQ ID NO: 30 

349 <211> LENGTH: 19 

350 <212> TYPE: DN 

351 <213> organism!^ synthetic 

353 <400> SEQUENCE: 

354 atccatatcg gctggcttc 

357 <210> SEQ ID NO: 31 

358 <211> LENGTH: 20 

359 <212> TYPE: DNA 




20 




20 



23 




21 




21 




21 




21 
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VERIFICATION SUMMARY DATE: 04/02/2002 

PATENT APPLICATION: US/10/099,791 TIME: 15:37:48 

Input Set : A:\CEN 0245 Seq.txt 

Output set: N:\CRF3\04022002\J099791.raw 

L:9 M:270 C: Current Application Number differs, Replaced Current Application No 
L:9 M:271 C: Current Filing Date differs, Replaced Current Filing Date 



file://C:\Crf3\OutholchVsrJ099791.htin 



